Improving Video Retrieval Using Multilingual Knowledge Transfer
نویسندگان
چکیده
Video retrieval has seen tremendous progress with the development of vision-language models. However, further improving these models require additional labelled data which is a huge manual effort. In this paper, we propose framework MKTVR, that utilizes knowledge transfer from multilingual model to boost performance video retrieval. We first use state-of-the-art machine translation construct pseudo ground-truth video-text pairs. then learn representation where English and non-English text queries are represented in common embedding space based on pretrained evaluate our proposed approach four datasets such as MSRVTT, MSVD, DiDeMo Charades. Experimental results demonstrate achieves all outperforming previous Finally, also video-retrieval dataset encompassing six languages show outperforms zero-shot setting.
منابع مشابه
Improving Deliverable Speech-to-Text Systems with Multilingual Knowledge Transfer
This paper reports our recent progress on using multilingual data for improving speech-to-text (STT) systems that can be easily delivered. We continued the work BBN conducted on the use of multilingual data for improving Babel evaluation systems, but focused on training time-delay neural network (TDNN) based chain models. As done for the Babel evaluations, we used multilingual data in two ways:...
متن کاملKnowledge Transfer: Revisiting Video
Knowledge transfer has been an important issue for organizational knowledge management programs. This article reviews the plethora of user-generated video activity and the issues it creates for knowledge management activities. Video’s media richness combined with its ability to convey rich narratives can facilitate sensemaking and learning. However, structure and culture are important factors t...
متن کاملUsing Knowledge Representation Languages for Video Annotation and Retrieval
Effective usage of multimedia digital libraries has to deal with the problem of building efficient content annotation and retrieval tools. In particular in video domain, different techniques for manual and automatic annotation and retrieval have been proposed. Despite the existence of well-defined and extensive standards for video content description, such as MPEG-7, these languages are not exp...
متن کاملImproving Question Retrieval in Community Question Answering Using World Knowledge
Community question answering (cQA), which provides a platform for people with diverse background to share information and knowledge, has become an increasingly popular research topic. In this paper, we focus on the task of question retrieval. The key problem of question retrieval is to measure the similarity between the queried questions and the historical questions which have been solved by ot...
متن کاملImproving Retrieval Performance using World-Knowledge Generated Features
Information Retrieval is the task of retrieving information items (documents, images, videos etc.) most relevant to a given user query. The common approach in textual IR systems is to index and retrieve documents by selecting representative key words and phrases within them, using various statistical, linguistic and semantic methods, and viewing each document as a vector in the vector space def...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2023
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-28244-7_42